Language Model and Speaking Rate Adaptation for Spontaneous Presentation Speech Recognition
نویسندگان
چکیده
منابع مشابه
Unsupervised class-based language model adaptation for spontaneous speech recognition
This paper proposes an unsupervised, batch-type, class-based language model adaptation method for spontaneous speech recognition. The word classes are automatically determined by maximizing the average mutual information between the classes using a training set. A class-based language model is built based on recognition hypotheses obtained using a general word-based language model, and linearly...
متن کاملSpeaking rate dependent acoustic modeling for spontaneous lecture speech recognition
The paper addresses large vocabulary spontaneous speech recognition focusing on acoustic modeling that considers the speaking rate. Using the real lecture speech corpus collected under the priority research project in Japan, we have made baseline acoustic model, and evaluated on the automatic transcription of oral presentations by experienced speakers and obtained word accuracy of 58.2%. Compar...
متن کاملFrame-period adaptation for speaking rate robust speech recognition
This paper describes a frame-period adaptation method for speaking rate robust speech recognition. The proposed method determines an appropriate frame-period for each phrase by measuring its speaking rate or computing the acoustic likelihood with a set of frame-periods. Experimental results on spontaneous speech recognition show that the proposed method is effective for slower utterance. Actual...
متن کاملUnsupervised language model adaptation methods for spontaneous speech
In this paper we examine the performance of three different unsupervised language model adaptation schemes applied to speech recognition of spontaneous speech lecture presentations. Two of the schemes have been described previously in the literature while the third is a variation of one of the other two schemes. All three schemes are based on a combination of word -gram and class -gram models a...
متن کاملDynamic language model adaptation using presentation slides for lecture speech recognition
We propose a dynamic language model adaptation method that uses the temporal information from lecture slides for lecture speech recognition. The proposed method consists of two steps. First, the language model is adapted with the text information extracted from all the slides of a given lecture. Next, the text information of a given slide is extracted based on temporal information and used for ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: IEEE Transactions on Speech and Audio Processing
سال: 2004
ISSN: 1063-6676
DOI: 10.1109/tsa.2004.828641